
Human DNase I 



* ** 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



1039 bp mRNA 
I mRNA , comp 1 e t e 



PRI 



06-MAR-1995 



cds 



181623 



cDNA to mRNA. 



sig peptide 



CDS 



HUMDNASEI 
Human DNase 
M55983 

M55983.1 GI 
DNase I . 
Human pancreus, 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak,S., Capon, D.J. , Hellmiss,R., Marsters, S.A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

Location/Qualifiers 
1. . 1039 

/organism =,, Homo sapiens" 
/db_xref ="taxon : 9606" 
/clone="hDNase-18-l" 
/tissue_type-"pancreas" 
160. .225 
/gene="DNase I" 
160. .1008 
/gene=" DNase I" 
/codon_start=l 
/product="DNase I" 
/protein id=" AAA63170 . 1 " 
/db_xref="GI : 181624" 

/translation— "MRGMKLLGALLALAALLQGAVSLKIAAFNIQT FGETKMSNATLV 
SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYWSEPLGRNSYKERY 
LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG 
DAVAEIDALYDVYLDVQEKWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLI 
PDSADTTATPTHCAYDRIWAGMLLRGAWPDSALPFN FQAAYGLSDQLAQAISDHYP 
VEVMLK" 



gene 



mat peptide 



160. .1008 
/gene="DNase I" 
226. .1005 
/gene="DNase I" 
/product^ " DNase 



if 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 



226 a 



305 c 



282 g 



226 t 



tcctgcacag gcagtgcctt 

ttttctttaa gcagcaaaag 

cattctcgtc atctctgagg 

ggggcgctgc tggcactggc 

ttcaacatcc agacatttgg 

gtgcagatcc tgagccgcta 

actgccgtgg ggaagctgct 

gtggtcagtg agccactggg 

cctgaccagg tgtctgcggt 

aacgacacct tcaaccgaga 

agggagtttg ccattgttcc 

661 gctctctatg acgtctacct 

721 atgggcgact tcaatgcggg 

781 ctgtggacaa gccccacctt 

841 cccacgcact gtgcctatga 

901 gttcccgact cggctcttcc 

961 gcccaagcca tcagtgacca 

1021 cacaccagtt gaactgcag 



gaagtgcttc ttcagagacc tttcttcata gactactttt 
gagaaaattg tcatcaaagg atattccaga ttcttgacag 
acatcaccat catctcagga tgaggggcat gaagctgctg 
ggccctactg cagggggccg tgtccctgaa gatcgcagcc 
ggagaccaag atgtccaatg ccaccctcgt cagctacatt 
tgacatcgcc ctggtccagg aggtcagaga cagccacctg 
ggacaacctc aatcaggatg caccagacac ctatcactac 
acggaacagc tataaggagc gctacctgtt cgtgtacagg 
ggacagctac tactacgatg atggctgcga gccctgcggg 
gccagccatt gtcaggttct tctcccggtt cacagaggtc 
cctgcatgcg gccccggggg acgcagtagc cgagatcgac 
ggatgtccaa gagaaatggg gcttggagga cgtcatgttg 
ctgcagctat gtgagaccct cccagtggtc atccatccgc 
ccagtggctg atccccgaca gcgctgacac cacagctaca 
caggatcgtg gttgcaggga tgctgctccg aggcgccgtt 
ctttaacttc caggctgcct atggcctgag tgaccaactg 
ctatccagtg gaggtgatgc tgaagtgagc agcccctccc 



// 



Human DNase I construct 



LOCUS 

DEFINITION 
site 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



sig_peptide 



CDS 



gene 



mat peptide 



BASE COUNT 
ORIGIN 



MHDNASE.DN 783 bp mRNA PRI 06-MAR-1995 K < , ■ 

Human DNase I mRNA, complete cds, Mature sequence modified to remove Narl 

M55983 
gl81623 
DNase I. 

Human pancreus, cDNA to mRNA. 
Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebra ta; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak,S., Capon, D. J., Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 
91067 672 

Location/Qualifiers 
1. .1039 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="hDNase-18-l" 
/tissue_type= "pancreas" 
160. .225 
/gene="DNase I" 
160. .1008 
/gene="DNase I" 
/ codon__s t a r t = 1 
/product="DNase I" 
/db_xref="PID:gl8162 4" 

/ trans lation="LKIAAFNIQTFGETKMSNATLVSYIVQILSRYDIALVQEVRDSH 
LTAVGKLLDNLNQDAPDTYHYWSEPLGRNSYKERYLFVYRPDQVSAVDSYYYDDGCE 
PCGNDT FNREPAIVRFFSRFTEVRE FAIVPLHAAPGDAVAEIDALYDVYLDVQEKWGL 
EDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLIPDSADTTATPTHCAYDRIVVAG 
MLLRGAVVPDSALPFNFQAAYGLSDQLAQAISDHYPVEVMLK" 
160. .1008 
/gene=" DNase I" 
226. .1005 
/gene=" DNase I" 
/product="DNase I" 
168 a 236 c 220 g 159 t 



// 



1 CTGAAGATCG 
61 CTCGTCAGCT 
121 AG AG AC AGC C 
181 GACACCTATC 
241 CTGTTCGTGT 
301 TGCGAGCCCT 
361 CGGTTCACAG 
421 GTAGCCGAGA 
481 GAGGACOTCA 
541 TGGTCATCCA 
601 G AC AC C AC AG 
661 CTCCGAGGGG 
721 CTGAGTGACC 
781 TGA 



CAGCCTTCAA 
ACATTGTGCA 
ACCTGACTGC 
ACTACGTGGT 
ACAGGCCTGA 
GCGGGAACGA 
AGGTCAGGGA 
TCGACGCTCT 
TGTTGATGGG 
TCCGCCTGTG 
CTACACCCAC 
CCGTTGTTCC 
AACTGGCCCA 



CATCCAGACA 
GATCCTGAGC 
CGTGGGGAAG 
CAGTGAGCCA 
CCAGGTGTCT 
CACCTTCAAC 
GTTTGCCATT 
CTATGACGTC 
CGACTTCAAT 
GACAAGCCCC 
GCACTGTGCC 
CGACTCGGCT 
AGCCATCAGT 



TTTGGGGAGA 
CGCTACGACA 
CTGCTGGACA 
CTGGGACGGA 
GCGGTGGACA 
CGAGAGCCAG 
GTTCCCCTGC 
TACCTGGATG 
GCGGGCTGCA 
ACCTTCCAGT 
TATGACAGGA 
CTTCCCTTTA 
GACCACTATC 



CCAAGATGTC 
TCGCCCTGGT 
ACCTCAATCA 
ACAGCTATAA 
GCTACTACTA 
CCATTGTCAG 
ATGCGGCCCC 
TCCAAGAGAA 
GCTATGTGAG 
GGCTGATCCC 
TCGTGGTTGC 
ACTTCCAGGC 
CAGTGGAGGT 



CAATGCCACC 
CCAGGAGGTC 
GGACGCACCA 
GGAGCGCTAC 
CGATGATGGC 
GTTCTTCTCC 
GGGGGACGCA 
ATGGGGCTTG 
ACCCTCCCAG 
CGACAGCGCT 
AGGGATGCTG 
TGCCTATGGC 
GATGCTGAAG 



Fig. 2(A) 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

source 



PAS155 GB. 



858 BP SS-DNA 



SYN 



2 9-A : UG^-2 000 



sig_peptide 
CDS 



Location/Qualifiers 
10. .75 

/note="160 to 225 of Untitledl" 
<10. ->75 

/note="l to 1039 of M55983.DNA [Split]" 
<10. .>75 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
/clone="hDNase-18-l" 
/tissue_type="pancreas [Split] " 
10. .75 

/gene="DNase I" 
10. ,>75 
/gene="DNase I" 
/codon_start="l" 
/product="DNase I" 
/db_xref="PID:gl81624" 

/translation="MRGMKLLGALLALAALLQGAVS(LKIAAFNIQTFGETKJyiSNATLV 

SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYVVSEPLGRNSYKERY 

LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG 1 

D. . . [Split]" 
10. .>75 

/gene="DNase I [Split]" 
76. . 858 

/ no te="l to 783 of mod humanDNAsel" 
76. .858 

/note-"l to 72 of 1041inker" 
join (76. . >12 9,<131. .14 7) 
/note="l to 72 of 1031inJcer [Split]" 
join (76. .>12 6,<127. .>12 9,<131. .14 7) 
/note="l to 78 of 1021inker [Split]" 
177 A 260 C 251 G 170 T 0 OTHER 



gene 
frag 
frag 
frag 
frag 



BASE COUNT 
ORIGIN 

1 GCCGCCACC A TGAGGGGCAT GAAGCTGCTG GGGGCGCTGC TGGCACTGGC 
61 CAGGGGGCCG TGTCCCTGAA GATCGCAGCC TTCAACATCC AGACATTTGG 
121 ATGTCCAATG CCACCCTCGT CAGCTACATT GTGCAGATCC TGAGCCGCTA 
181 CTGGTCCAGG AGGTCAGAGA CAGCCACCTG ACTGCCGTGG GGAAGCTGCT 
241 AATCAGGACG CACCAGACAC CTATCACTAC GTGGTCAGTG AGCCACTGGG 
301 TATAAGGAGC GCTACCTGTT CGTGTACAGG CCTGACCAGG TGTCTGCGGT 
361 TACTACGATG ATGGCTGCGA GCCCTGCGGG AACGACACCT TCAACCGAGA 
4 21 GTCAGGTTCT TCTCCCGGTT CACAGAGGTC AGGGAGTTTG CCATTGTTCC 
4 81 GCCCCGGGGG ACGCAGTAGC CGAGATCGAC GCTCTCTATG ACGTCTACCT 
541 GAGAAATGGG GCTTGGAGGA CGTCATGTTG ATGGGCGACT TCAATGCGGG 
601 GTGAGACCCT CCCAGTGGTC ATCCATCCGC CTGTGGACAA GCCCCACCTT 
661 ATCCCCGACA GCGCTGACAC CACAGCTACA CCCACGCACT GTGCCTATGA 
721 GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT GTTCCCGACT CGGCTCTTCC 
7 81 CAGGCTGCCT ATGGCCTGAG TGACCAACTG GCCCAAGCCA TCAGTGACCA 
841 GAGGTGATGC TGAAGTGA 

// 



GGCCCTACTG 
GGAGACCAAG 
CGACATCGCC 
GGACAACCTC 
ACGGAACAGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CTATCCAGTG 



Fig. 2(B) 



* * * 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 

SITES 



pAS6 - light chain 



HMFG1LC2.D 721 bp DNA 

HUMANISED HMFG1 LIGHT CHAIN Vnp LEADER 



18-AUG-1998 



1 (BASES 1 TO 342) 
VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 ETC 
IMMUNOL. (1993) :78, 364-370 
SCANNED IN FROM JOURNAL 



This is the sequence of the HMFG1 light chain gene with the 
Vnp leader sequence attached. Translate from 
residue 1. Note residue 399 is T > A in all clones leading 
to R133 silent mutation (T in Verhoeyen paper) 



BASE COUNT 
ORIGIN 

x 



197 



202 c 



182 g 



140 t 



TiFiAPFR SEP 



1 ATGGGATGGA 
61 AT C C AG AT G A 
121 ACCTGTAAGT 
181 TACCAGCAGA 
241 TCTGGTGTGC 
301 AGCAGCCTCC 

3 61 CGGACGTTCG 

4 21 TTCATCTTCC 
481 CTGAATAACT 
541 TCGGGTAACT 
601 AGCAGCACCC 
661 GTCACCCATC 
721 A 



GCTGTATCAT CCTCTTCTTG 
CCCAGAGCCC AAGCAGCCTG 
CCAGTCAGAG CCTTTTATAT 
AGCCAGGTAA GGCTCCAAAG 
CAAGCAGATT CAGCGGTAGC 
AGCCAGAGGA CATCGCCACC 
GCCAAGGGAC CAAGGTGGAA 
CGCCATCTGA TGAGCAGTTG 
TCTATCCCAG AGAGGCCAAA 
CCCAGGAGAG TGTCACAGAG 
TGACGCTGAG CAAAGCAGAC 
AGGGCCTGAG CTCGCCCGTC 



GTAGCAACAG 
AGCGCCAGCG 
AGTAGCAATC 
CTGCTGATCT 
GGTAGCGGTA 
TACTACTGCC 
ATCAAACGAA 
AAATCTGGAA 
GTACAGTGGA 
CAGGACAGCA 
TACGAGAAAC 
AC AAAG AG C T 



CTACAGGTGT 
TGGGTGACAG 
AAAAG AT C T A 
ACTGGGCATC 
CCGACTTCAC 
AGCAATATTA 
CTGTGGCTGC 
CTGCCTCTGT 
AGGTGGATAA 
AGGACAGCAC 
ACAAAGTCTA 
TCAACAGGGG 



CCACTCC GAC 
AGTGACCATC 
CTTGGCCTGG 
CACTAGGGAA 
CTTCACCATC 
TAGATATCCT 
ACCATCTGTC 
TGTGTGCCTG 
CGCCCTCCAA 
CTACAGCCTC 
CGCCTGCGAA 
AGAGTGTTAG 



// 



Fig. 3(A) 




LOCUS 
DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 

BASE COUNT 



HHMFG1KLC 



730 BP SS-DNA 



SYN 



29-AUG-2000 



198 A 



Location /Qualifiers 
10. .730 

/note="l to 721 of hHMFGllight chain" 
10. .730 

/note="l to 72 of 1041inker" 
join(10. .>63, <65. .81) 
/note="l to 72 of 1031inker [Split] 
join (10. .>60,<61. . >63, <65. .81) 
/note="l to 78 of 1021inker [Split] 

208 C 184 G 140 T 0 OTHER 



n 



ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC 

61 CACTCCGACA TCCAGATGAC CCAGAGCCCA AGCAGCCTGA GCGCCAGCGT 

121 GTGACCATCA CCTGTAAGTC CAGTCAGAGC CTTTTATATA GTAGCAATCA 

181 TTGGCCTGGT AC C AG C AG AA GCCAGGTAAG GCTCCAAAGC TGCTGATCTA 

241 ACTAGGGAAT CTGGTGTGCC AAGCAGATTC AGCGGTAGCG GTAGCGGTAC 

301 TTCACCATCA GCAGCCTCCA GCCAGAGGAC ATCGCCACCT ACTACTGCCA 

361 AGATATCCTC GGACGTTCGG CCAAGGGACC AAGGTGGAAA TCAAACGAAC 

421 CCATCTGTCT TCATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC 

481 GTGTGCCTGC TGAATAACTT CTATCCCAGA GAGGCCAAAG TACAGTGGAA 

541 GCCCTCCAAT CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA 

601 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 

661 GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA CAAAGAGCTT 
721 GAGTGTTAGA 

// 



TACAGGTGTC 
GGGTGACAGA 
AAAG AT C T AC 
CTGGGCATCC 
CGACTTCACC 
GCAATATTAT 
TGTGGCTGCA 
TGCCTCTGTT 
GGTGGATAAC 
GGACAGCACC 
CAAAGTCTAC 
CAACAGGGGA 



Fig. 3(B) 



HMFG-1 light chain with Vnp Leader (shaded) 



MGWSGliLFLVATAfGVHSDIQMTQSPSSLSASVGDRVTITCKSSQSL 

LYSSNQKIYLAWYQQKPGKAPKLLIYWASTRESGVPSRFSGSGSGT 

DFTFTISSLQPEDIATYYCQQYYRYPRTFGQGTKVEIKRTVAAPSVFI 

FPPSDEQLKSGTASWCLLNNFYPREAKVQWKVDNALQSGNSQESV 

TEQDSKDSTYSLSSTLTLSKADYEKHKVYACEVTHQGLSSPVTKSFN 

RGEC 




LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 



pAS6 - heavy chain 



r - » 



i\ (3 



HHMFG1HC . D 1404 bp DNA 

HUMANISED HMFG1 heavy chain 
HHMFG1H 



14-AUG-1998 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 

IMMUNOL. (1993) :78, 364-370 

VH domain SCANNED IN FROM JOURNAL 



AA RESIDUE 235 
Residue 963 is 
Note 

333 a 439 

9 



HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 
G > T leading to silent mutation in all clones 



379 g 



253 t 



£ 5^ 



1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 



t 



LEADER 



TGGGATG.GA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 



/'AL GAUAAAACTC 



7 81 TTCCTCTTCC 
841 TGCGTGGTGG 
901 GGCGTGGAGG 
961 CGTGTGGTCA 
1021 TGCAAGGTCT 
1081 GGGCAGCCCC 
1141 AACCAGGTCA 
1201 TGGGAGAGCA 
12 61 GACGGCTCCT 
1321 AACGTCTTCT 
1381 CTCTCCCTGT 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACA|1'GCCC 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAG G AC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 



CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GAGAACCACA 
GCCTGACCTG 
ATGGGCAGCC 
TCTTCCTCTA 
CATGCTCCGT 
CTCCGGGTAA 



CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 
CCTGGTCAAA 
GG AG AACAAC 
CAGCAAGCTC 
GAT G C AT GAG 
ATGA 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAp 



CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
C C CAT C GAGA 
CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 
ACCGTGGACA 
GCTCTGCACA 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 
CCC GGGATGA 
CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 



CCACTCC(CAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AG G AC T C T AC 
CTACATCTGC 
CAAATCTTGT 
ACCGTCAGTC 
TGAGGTCACA 
GTACGTGGAC 
CAGCACGTAC 
GGAGTACAAG 
CAAAGCCAAA 
GCTGACCAAG 
CGCCGTGGAG 
GCTGGACTCC 
GCAGCAGGGG 
GCAGAAGAGC 



Antibody DNase Fusions Made Here 
(eg pAS34 39. ) 



End of lower hinge region of heavy chain. PAPE Amino 
Acid Seq. Fab' 2 fusions were made at this point. 

Those with HYBRID HINGES are altered further 
up 



^ This part G AC AAAAC T G AC AC A 

D K T H T 



After this sequence you get the HYBRID HINGE + LINKER SEQUENCES 
Then DNAse I (eg Fab-DNase construct pAS302) 



Fig. 3(D) 
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LOCUS PAS23.DNA 1554 bp mRNA PRI 0 6-MAR-1995 

DEFINITION Humanised HMFG1 Fab ' 2 fused to human DNase I (construct 1) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna ) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shak,S., Capon, D. J . , Hellmiss,R., Marsters , S . A. and Baker,C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 

fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 344 a 468 c 434 g 308 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA* GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGA AG GGAGCGGCGG G CTGAAGATC 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT ACGATGATGG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
12 61 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC T G AC AC C AC A 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
14 41 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GTGA 

// 



Fig. 5(A) 
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I ft*. 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 



FDDNASE2 3 



1554 BP SS-DNA 



SYN 



25-AUG-2000 



344 A 



Location/Qualifiers 

join(l. .>720,<787. .1554) 

/note="l to 1554 of 23.dna [Split]" 

721. .786 

/note="l to 66 of 23/271inker" 

join (721. . >735, <736. .786) 

/note-"l to 78 of 1021inker [Split]" 

466 C 435 G 309 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



s. 3—" 



61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT C C AC AAAC AC 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG 

4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC 

721 G AC AAAACT C ACACATGTCC ACCGTGTCCA GCACCAGAG G GGAGCGGCGG 

781 GCAGCCTTCA AC AT C C AG AC ATTTGGGGAG 

841 TACATTGTGC AGATCCTGAG CCGCTACGAC 

901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC 

961 CACTACGTGG TCAGTGAGCC ACTGGGACGG 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GCTGAAGATC 



ACCAAGATGT 
ATCGCCCTGG 
AACCTCAATC 
AACAGCTATA 



CCAATGCCAC 
TCCAGGAGGT 
AGGACGCACC 
AGGAGCGCTA 



1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT AC GAT GAT GG 

1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC 

1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC 

1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT 

12 61 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA 

1321 ATCCGGCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC 

1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT 

1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG 

1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA 



CCTCGTCAGC 
CAGAGACAGC 
AGACACCTAT 
CCTGTTCGTG 
CTGCGAGCCC 
CCGGTTCACA 
AGTAGCCGAG 
GGAGGACGTC 
GTGGTCATCC 
TGACACCACA 
GCTCCGAGGG 
CCTGAGTGAC 
GTGA 
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Fig. 5(B) 
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LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



FDDNASE23K 



1563 BP SS-DNA 



SYN 



29-AUG-2000 



345 A 

GCCGCCACCA 
CACTCCCAGG 
AAGGTGTCCT 
CAGGCTCCAG 
AG AT AC AAT G 
GCCTACATGG 
TCCTACGACT 
GCCTCCACCA 
GGCACAGCGG 
TGGAACTCAG 
GGACTCTACT 
TACATCTGCA 
AAATCTTGTG 
CTGAAGATCG 



Location /Qualifiers 
10. . 1563 

/note="l to 1554 of FdDNase2 3correct " 
join(10. ,>729,<796. .1563) t 
/note="l to 1554 of 23.dna [Split]" 
730. .795 

/note-"l to 66 of 23/271inker" 

join (730. .>744,<745. .795) 

/note="l to 78 of 1021inker [Split]" 

472 C 437 G 309 T 0 OTHER 



TGGGATGGAG 
TGCAGCTGGT 
GCAAGGCTTC 
GAAAGGGCCT 
AGAAGTTCAA 
AGCTCAGCAG 
TTGCCTGGTT 
AGGGCCCATC 
CCCTGGGCTG 
GCGCCCTGAC 
CCCTCAGCAG 
ACGTGAATCA 
ACAAAACTCA 
CAGCCTTCAA 
ACATTGTGCA 



CTGTATCATC 
GCAGTCTGGG 
TGGCTACACC 
CGAGTGGGTC 
GGGCCGAGTG 
CCTGAGGTCT 
TGCTTACTGG 
GGTCTTCCCC 
CCTGGTCAAG 
CAGCGGCGTG 
CGTGGTGACC 
CAAGCCCAGC 
CACATGTCCA 
CATCCAGACA 
GATCCTGAGC 



CTCTTCTTGG 
GCAGAGGTGA 
TTCAGTGCCT 
GGAGAGATTT 
ACAGTCACTA 
GAG G AC AC AG 
GGCCAAGGGA 
CTGGCACCCT 
GACTACTTCC 
CACACCTTCC 
GTGCCCTCCA 
AACACCAAGG 
CCGTGTCCAG 
TTTGGGGAGA 
CGCTACGACA 



TAGCAACAGC 
AAAAGCCTGG 
AC T GG AT AG A 
TACCTGGAAG 
GAG AC AC AT C 
CCGTCTATTA 
CTCTGGTCAC 
CCTCCAAGAG 
CCGAACCGGT 
CGGCTGTCCT 
GCAGCTTGGG 
TGGACAAGAA 
C AC C AG AG GG 
CCAAGATGTC 
TCGCCCTGGT 



CTCGTCAGCT 

AGAGACAGCC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA 
GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA 
1081 TGCGAGCCCT GCGGGAACGA CACCTTCAAC CGAGAGCCAG CCATTGTCAG 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA 
12 61 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC 
1381 G AC AC C AC AG CTACACCCAC GCACTGTGCC TATGACAGGA TCGTGGTTGC 
14 41 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT 
1561 TGA 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
GAGCGGCGGG 
CAATGCCACC 
CCAGGAGGTC 
GGACGCACCA 
GGAGCGCTAC 
CGATGATGGC 
GTTCTTCTCC 
GGGGGACGCA 
ATGGGGCTTG 
ACCCTCCCAG 
CGACAGCGCT 
AGGGATGCTG 
TGCCTATGGC 
GATGCTGAAG 



// 



Fig. 5(C) 



■1 * *,' „ 

\ 

g 18 27 36 45 54 / " 

5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GttC CAC 



MGWSC I I LFLVATATG \ V H 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG ' AAA AAG CCT GGG GCC TCA 



SQVQ LVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCK .ASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVGE I L P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG AC A GTC ACT 



GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC AC A TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQS SG 

Fig. 5(D) 

(Sheet 1 of 3) 



LYSLSSVVTV PSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TY ICNVNHKPSNTKVDKK 



603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 

o — — — — — — — — — — — — — — — — — — 

bp EGSGGLKIAAFNIQTFGE 

.819 828 837 846 855 864 

In ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 



TKMSNATLVSYIVQILSR 

873 882 891 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG 



YDIALVQEVRDSHLTAVG 

927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



KLLDNLNQ DA PDTY HYVV 

981 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



SEPLGRNSYKERYLFVYR 

1035 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 



PDQVSAVDSYYYDDGCEP 

1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC A.AC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 



CGNDTFNREPAIVRFFSR 

1143 1152 1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 



FT EVR E FA I V PLH AA PG D 
1197 1206 1215 1224 1233 1242 

Fig. 5(D) 
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v; 



GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 



A V A E I DALYDVYLDVQ EK 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 



WGLEDVMLMGDFNAGC S Y 

1305 1314 1323 1332 1341 1350 

GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG 



VRPSQWSSIRLWTSPTFQ 

1359 1368 1377 1386 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT 



WLI PDS ADTTATPTHCAY 

1413 1422 1431 1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 



DRIVVAGMLLRGAVVP DS 

1467 1476 1485 1494 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 



ALPFNFQAAYGLSDQLAQ 

1521 1530 1539 1548 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



A I SDHY PVEVMLK 



Fig. 5(D) 
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LOCUS PAS27.DNA 1584 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab 1 2 fused to human DNase I with SV40 

NLS (construct 1) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna ) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shak,S., Capon, D. J., Hellmiss, R. , Marsters, S . A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 354 a 474 c 446 g 310 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 G AC AAAAC T C ACACATGCCC ACCGTGCCCA GCACCTGA AG GGAGCGGCGG G CTGAAGATC 
781 GCAGCCTTCA AC AT C C AG AC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT C AG AG AC AG C 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT AC GAT GAT GG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
12 61 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
14 41 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GGGGGGCGG& 
1561 CCCAAAAAGA AGCGCAAGGT TTGA 
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Fig. 6(A) 
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LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 



FDDNASE2 7 



1584 BP SS-DNA 



SYN 



25-AUG-2000 



354 A 



Location/ Qualifiers 

join(l. .>720,<787. .1584) 

/note="l to 1584 of 27.dna [Split]" 

721. . 786 

/note="l to 66 of 23/271inker" 

join (721. . >7 35,<736. .786) 

/note="l to 78 of 1021inker [Split]" 

472 C 447 G 311 T 0 OTHER 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTT GAG CC 

721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GGAGCGGCGG 

781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC 

841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT 

901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC 

961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA 

1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT AC GAT GAT GG 

1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC 

1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC 

1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT 

12 61 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA 

1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC 

1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT 

14 41 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG 

1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA 

1561 CCCAAAAAGA AGCGCAAGGT TTGA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GCTGAAGATC 
CCTCGTCAGC 
CAGAGACAGC 
AGACACCTAT 
CCTGTTCGTG 
CTGCGAGCCC 
CCGGTTCACA 
AGTAGCCGAG 
GGAGGACGTC 
GTGGTCATCC 
TGACACCACA 
GCTCCGAGGG 
CCTGAGTGAC 
GGGGGGCGGA 
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LOCUS FDDNASE27K 1593 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1593 

/note="l to 1584 of FdDNase27correct " 
frag join (10. .>72 9,<7 96. .1593) 

/note="l to 1584 of 27.dna [Split]" 
frag 730. .795 

/note="l to 66 of 23/271inker" 
frag join (730. . >7 44, <745. .795) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 355 A 478 C 449 G 311 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AG AT AC AAT G AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG GAGCGGCGGG 
781 CTGAAGATCG CAGCCTTCAA CATCCAGACA TTTGGGGAGA CCAAGATGTC CAATGCCACC 
841 CTCGTCAGCT ACATTGTGCA GATCCTGAGC CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
901 AGAGACAGCC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA GGACGCACCA 
961 GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA CGATGATGGC 
1081 TGCGAGCCCT GCGGGAACGA CACCTTCAAC CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
12 61 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
1381 GACACCACAG CTACACCCAC GCACTGTGCC TATGACAGGA TCGTGGTTGC AGGGATGCTG 
1441 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT GATGCTGAAG 
1561 GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT TGA 
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Fig. 6(C) 



9 18 27 3 6 4 5 '5*^ ^ ^ 

5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC CAC 



M. GWSCI I LFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKK PGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVG E I L P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG AC A GTC ACT 



GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC AX A TCC ACA AAC AC A GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSS LRS E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKG P S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAA L 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG' AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585' 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSSG 
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603 



612 



621 



630 



639 



64; 



* * 4 * \ 



CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 
LYSLSSVVTVPSSSLGTQ 



3 \ v 



v. ' 



657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



N V N H K 



K V 



K K 



711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 



V 



K 



D K 



K 



765 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 



K 



N 



819 828 837 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 



K M 



N 



V 



V 



R 



873 882 891 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG 



V 



V 



D 



H 



V 



927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



K 



D N 



N Q D 



D 



H 



V V 



981 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



R N 



K 



V 



R 



1035 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GA.G CCC 



V 



V 



D D 



1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 



N D 



N 



V 



R 



1143 1152. 1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 

FTEVREFAIVPLHAA PGD 



1197 



1206 



1215 
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1224 
- 2 - 



1233 



12/; 2 



- 

• * * 

GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 



A V A E I DALYDVYLDVQEK 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 



WGLEDVMLMGDFNAGCSY 

1305 1314 1323 1332 1341 1350 

GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG AC A AGC CCC ACC TTC CAG 



VRPSQWSSIRLWTSPTFQ 

1359 1368 1377 1386 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC A.CA GCT AC A CCC ACG CAC TGT GCC TAT 



WLIPDSADTTATPTHCAY 

™f 1413 1422 1431 1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 

;zf DRIVVAGMLLRGAVVPDS 

1467 1476 1485 1494 1503 1512 

■ GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 



ALPFNFQAAYGLSDQLAQ 

1521 1530 1539 1548 1557 1566 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA 



AISDHYPVEVMLKGGGPK 



1575 1584 
AAG AAG CGC AAG GTT TGA 3 ' 



K K R K V 
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PAS34 



. -.1 



3~X 



LOCUS PAS34.DNA 2196 bp 2196 bp 2196 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse const rued j34 

DEFINITION Clone 16.4.2 (same as hcdnasel.dna template file) 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna ) 

COMMENT The fusion was made using overlapping oligos AS79 and AS80 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 501 a 677 c 607 g 411 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC T G AGG AC AC A GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
12 01 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GC AT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA A GGGAGCGGC GGG CTGAAGA TCGCAGCCTT CAACATCCAG 
14 41 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CT AC GAT GAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 

17 41 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 

18 01 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 
18 61 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2 041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCATC 
2161 AGTGACCACT ATCCAGTGGA GGTGATGCTG AAGTGA 
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180 
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r: 279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

iy — — — — — — — — — — — — — — — — — — 

S : RDTSTNTAYMELSSLRSE 

^ 333 342 351 360 369 378 

"™ GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFA Y 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKG P S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAA L 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSS G 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 
657 666 675 684 693 702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNKKPSNTKVDKK 

■f 

711 720 729 "738 747 756 J/ ' 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GC A CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



L M I SRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



EDP EVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGKEYKC K 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREPQVYTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKNQVSLTCLVKGFY PSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQPENNTYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDSDG SFFLYSKLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



DKS RWQQGNVFSCSVMH E 
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1359 



1368 



1377 



1386 



13S5 
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PAS35 



LOCUS PAS35.DNA 2193 bp 2193 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 35 

DEFINITION Clone 17.12.1 with silent K to K mutation (1398 A > G) 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna) 

COMMENT The fusion was made using overlapping oligos AS81 and AS82 

FEATURES AA RESIDUE 2 35 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 

^ SITES Note 

*i BASE COUNT 500 a 677 c 606 g 410 t 

ORIGIN ? 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
4 21 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT GC AT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGAAa GG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 
14 41 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 
1561 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 

17 41 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 

18 01 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
18 61 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT G AC AC C AC AG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGAAG TGA 
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pAS36 



LOCUS PAS36.DNA 2190 bp 2190 bp DNA 14-AUG-r, 
1998 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse - construct 36 

DEFINITION Clone 18.24.1 with residue 1392 T > C 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna ) 

COMMENT The fusion was made using overlapping oligos AS83 and AS84 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES Residue 1392 T > C silent S to S mutation 

SITES Note 
BASE COUNT 498 a 678 c 605 g 409 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
w 421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
H 541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 G AC AAAACT C ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 T GG GAG AG C A ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CcCCG GGGAG CGGCGGG CTG AAGAT CGC AG CCTTCAACAT CCAGACATTT 

14 41 GGGGAGACCA AGATGTCCAA TGCCACCCTC GTCAGCTACA TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 

15 61 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 GTGGACAGCT ACTACTACGA TGATGGCTGC GAGCCCTGCG GG AAC G AC AC CTTCAACCGA 

17 41 GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 
1801 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 

18 61 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2041 GACAGGATCG TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGTGACCAAC TGGCCCAAGC CATCAGTGAC 
2161 CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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VKVSC KASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



V s * WVRQA PGKGLEWVGEILP 

]fj 225 234 243 252 261 270 

^ M GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

H TT — — — — — — — — — — — — — — — — — ~ ~ ~ — — 

111 GSNNS R YNEKFKGRVTVT 

IT*! 279 288 297 306 315 324 

!L~ = AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

£ Tt:? _ — — — — - — — — — — 

s= RDTSTNTAYMELSSLRSE 

b f^ 333 342 351 360 369 378 

=™ GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

IwJ DTAVY YCARSYDFAWFAY 

1^ 387 396 405 414 423 432 

TGG GGC CAA GGG A.CT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLA PSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSSG 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



YSLSSVVTVPSSSLGTQ 
G57 666 675 684 693 107 
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(Sheet 1 of 4) 



'7 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA , , ■ 

. ... .__ \S- ,vV x 1 * * 

... ... _.- __- --- --- --- y a $ 

TYICNVNHKPSNTKVDK'. K 

711 720 729 738 747 756 . % ! 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT --■*' 



VEPKSCDKTKTCPPCPA P 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



L M I S RT PEVTCVVVDV S. H 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGK EYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC A.TC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPIEKT ISKAK 

1089 1098 1107 11.16 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREPQVYTLPP SRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKNQVSLTCLVKG FYPSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDSDGSFFLYSKLTV 

1.305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



D K S R W 0 Q G W V F S C S V M H E 
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1359 1363 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 



ALKNHYTQKSLSLSP G S G 

_______ , 

1413 1422 1431 1440 1449 1453 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG AC A TTT GGG GAG ACC AAG ATG TCC 



GLKIAAFNIQTFGETKMS 



1467 1476 1485 1494 1503 1512 

AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 



NATLVS YIVQILSRYDIA 

1521 1530 1539 1548 1557 1566 

CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 



LVQEVRDSHLTAVGKLLD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 



NLNQDA PDTYHYVVS EPL 

1629 1638 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 



GRNSYKERYLFVYRPDQV 

Q 1683 1692 1701 1710 1719 1728 

'7k TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC GAC 



SAVDSYYYDDGCEPCGND 

1737 1746 1755 1764 1773 1782 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 



TFNREPAIVRFFS RFTEV 

1791 1800 1809 1818 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 



REFAIVPLHAAPGDAVAE 

1845 1854 1863 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG GAG 



IDA LYDVYLDVQEKWGLE 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AG A CCC TCC 



DVMLMGDFNAGCSYVRPS 

1953 1962 1971 1980 1989 1998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC CCC 



QWSSIRLWTSPTFQWLI ? 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 



DSADTTATPTHCAY D R I V 
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2061 2070 2079 2088 2097 2106 

GTT-GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 



VAGML L RGAVVPDSAL P F 

2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 



NFQAAYGLSDQLAQAI SD 

2169 2178 2187 

CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



HYPVEVMLK 
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LOCUS PAS37.DNA 2226 bp 2196 bp 2196 bp DNA 14-AUG-^' 

1998 ' v. o > 

DEFINITION HUMANISED HMFG1 heavy chain fused to human DNAse construct 37 

DEFINITION Clone 16.4.2 (same as hcdnasel . dna template file) plus NLS 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993)178, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna ) 

COMMENT The fusion was made using overlapping oligos AS79 and AS8 0 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 511 a 683 c 619 g 413 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
« 121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GAT G CAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA A GGGAGCGGC GGG CTGAAGA TCGCAGCCTT CAACATCCAG 

14 41 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 

15 61 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 

17 41 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 

18 01 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 
18 61 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCATC 
2161 AGTGACCACT ATCCAGTGGA GGTGATGCTG AA GGGGGGCG GACCCAAAAA, GAAGCGCAAG 
2221 GTTTGA ===== ~~ 



Fig. 10(A) 



>NLS 



9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT*GTC CAC> 



MGWSC I I LFLVATATGVH 
— — : 

63 72 81 90 99 " " 1*08 .* 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



S QVQ L VQ SGAEVK K PG A S 



117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



V7VRQA PGKGLEWVG E I L P 

.225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNSRYNE'KFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYM-E-LS S LRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396. 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQ.S SG 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSS SLGTQ 
657 666 675 634 693 702 

Fig. 10(B) 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA . 



TYICNVNHKPSNTKVDKK i \ \ 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 



VEPKSC DKTHTC P PC PAP 

765 774 733 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPKPK DT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



L M I S RT P EVTCVVVDVS H 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



EDPEVK FNWYVDGVEVHN 

3 = 5 

Q 927 936 945 954 963 972 

s = GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNG KEY KCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKAL PAPIEKTIS KAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREPQVYTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKNQVS LTCLVKG FY PSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQPENNY KTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDS DGSFFLYSKLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



DKSRWQOGNVFSCSVMHE 
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1359 1368 1377 1386 1395 1404 f * 

GCT CTG CAC AAC CAC TAC ACG CAG A-AG AGC CTC TCC CTG TCT CCG GGT AAA GGG 

^ v 

A L H N H Y T Q K S L S L S P G K G V 

1413 1422 1431 1440 1449 1458 

AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG AC A TTT GGG GAG ACC AAG 



SGGLK IAAFNIQT- FG ETK 



1467 1476 1485 1494 1503 1512 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MSNATLVSYIVQI LSRYD 

1521 1530 1539 1548 1557 1566 

ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



IALVQEVRDSHLTAVGKL 

1575 1584 1593 1602 1611 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQDA PDTYHYVVSE 

1629 1638 1647 1656 1665 1674 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PLGRNSYKERYLFVYRPD 

1683 1692 1701 1710 1719 1728 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



QVSAVDSYYYDDGCEPCG 

1737 1746 1755 1764 1773 1782 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC AC A 



NDTFNREPAIVRFFSRFT 

1791 1800 1809 1818 1827 1836 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFA IVPLHAAPGDAV 

1845 1854 1863 1872 lSei 1890 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 



AEIDALYDVYLDVQEKWG 

1899 1908 1917 1926 1935 1S44 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 



LEDVMLMGDFNAGCSYVR 

1953 1962 1971 1980 1989 1998 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG AC A AGC CCC ACC TTC CAG TGG CTG 



PSQWS S IRLWTSPTFQWL 

2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC AC A GCT AC A CCC ACG CAC TGT GCC TAT GAC AGG 



I PDSA DTTATPTHCAYDP. 
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si 



2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGML L R G A V V ? D S A L 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQAAYGLSDQLAQA I 

2169 2178 2187 2196 2205 2214 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG 

SDHY PVEVMLK G G G P K K K 
2223 

CGC AAG GTT TGA 3 ' 
R K V 
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PAS38 



LOCUS PAS38.DNA 2223 bp 2193 bp DNA 14-AUG-1998 , " 

DEFINITION HUMANISED HMFG1 heavy chain, fused to human DNAse construct 3% *jy 

DEFINITION Clone 17.12.1 with silent K to K mutation (1398 A > G) +NLS '* ~ ' 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFG1 etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase . dna) 

COMMENT The fusion was made using overlapping oligos AS81 and AS82 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 

SITES Note 
BASE COUNT 510 a 683 c 618 g 412 t 

r*s ORIGIN ? 



5 1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 

t J . 61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

H 121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

fj 181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

3 241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGA'CACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

Ij 3 61 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

.421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

3 481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

S 541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

Z_ ,601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 

781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 

841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 

901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 

961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 

1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC C AAAG C C AAA 

1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 

1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 

1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 

12 61 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 

1321 AACGTCTTCT CATGCTCCGT GAT G CAT GAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 

1381 CTCTCCCTGT CTCCGAAa GG GAGCGGCGGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 

14 41 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 

15 61 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 

17 41 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 

18 01 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
18 61 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT G AC AC C AC AG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGAA G GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT 
2221 TGA — 
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117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WV R Q A P-'G KG LEWVG E I LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSS LRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA. GGA 

GALTSGVHTFPAVLQSSG 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA \\? 

--- --- --- --- --- --- ^ 

tyicnvnhkpsntkvdkk' 4 

<*■ *. V*" 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPC PAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



ELLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



LMISRTPEVTCVVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 



EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGKEYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPI EKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREPQVYTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKNQVSLTCLVKGFYPSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQ PENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDSDGSFFLYSKLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



DKSRWQQGNVFSCSVHHE 
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1359 1368 1377 1386 1395 1404 \~* N <0 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 



1413 1422 1431 1440 1449 1458 

GGC GGG CTG AA.G ATC GCA GCC TTC AA.C ATC CAG AC A TTT GGG GAG ACC AAG ATG 

GGLK IAAFNIQTFGETKM 



1467 1476 1485 1494 1503 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 

SNAT LVSYIVQILSRYD I 

1521 1530 1539 1548 1557 1566 

GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 

ALVQ EVRDSHLTAVGKLL 

1575 1584 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 

DNLNQDAPDTYHYVVSE P 

1629 1638 1647 1656 1665 1674 

CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG 

LGRN SYKERYLFVYRPDQ 

1683 1692 1701 1710 1719 1728 

GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 

VSAV DSYYYDDGCEPCGN 

1737 1746 1755 1764 1773 1782 

GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG 

DTFNRE PAIVRFFSRFTE 

1791 1800 1809 1818 1827 1836 

GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC 

VREF AIVPLHAAPG DAVA 

1845 1854 1863 1872 1881 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 

EIDA LYDVYLDVQEKWG L 

1899 1908 1917 1926 1935 1944 

GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC 

EDVM LMGDFNAGCSYVR P 

1953 1962 1971 1980 1989 1998 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 

SQWS S I RLWTSPTFQWL I 

2007 2016 2025 2034 2043 2052 

CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 

PDSA DTTATPTHCA YDR I 
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2061 2070 2079 2088 2097 2106 

GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC 



VVAGMLLRGAVV PDS AL P 

2115 2124 2133 2142 2151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 



^ FNFQAAYGL SDQLAQAIS 

ill 2169 2178 2187 2196 2205 2214 

Q GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG CGC 
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n 2223 
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LOCUS 
1998 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 




PAS 3 9 . DN A 



2220 bp 2190 bp 



DNA 



,14 -AUG- „ '< 



HUMANISED HMFG1 heavy chain fused to human DNAse - construct 39 
Clone 18.24.1 with residue 1392 T > C +NLS 

VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFG1 etc 
IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as a result of oligo assembly 



(mhdnase . dna ) 



COMMENT 
FEATURES 
FEATURES 
FEATURES 

SITES 
BASE COUNT 
ORIGIN 



The fusion was made using overlapping oligos AS83 and AS84 

AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

Residue 963 is G > T leading to silent mutation in all clones 

Residue 1392 T > C silent S to S mutation 
Note 

508 a 684 c 617 g 411 t 
-> 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



1 

61 GTGCAGCTGG 
121 TGCAAGGCTT 
181 GGAAAGGGCC 
241 GAGAAGTTCA 
301 GAGCTCAGCA 
361 TTTGCCTGGT 
421 AAGGGCCCAT 
481 GCCCTGGGCT 
541 GGCGCCCTGA 
601 TCCCTCAGCA 
661 AACGTGAATC 
721 GACAAAACTC 
7 81 TTCCTCTTCC 
841 TGCGTGGTGG 
901 GGCGTGGAGG 
961 CGTGTGGTCA 
1021 TGCAAGGTCT 
1081 GGGCAGCCCC 
1141 AACCAGGTCA 
1201 TGGGAGAGCA 
1261 GACGGCTCCT 
1321 AACGTCTTCT 
1381 CTCTCCCTGT 
14 41 GGGGAGACCA 
1501 TACGACATCG 
1561 CTGGACAACC 
1621 GGACGGAACA 
1681 GTGGACAGCT 
1741 GAGCCAGCCA 
1801 CCCCTGCATG 
18 61 CTGGATGTCC 
1921 GGCTGCAGCT 
1981 TTCCAGTGGC 
2 041 GACAGGATCG 
2101 CCCTTTAACT 
2161 CACTATCCAG 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGC.GT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
CCCCAAAACC 
TGGACGTGAG 
TGCATAATGC 
GCGTCCTCAC 
CCAACAAAGC 
GAGAACCACA 
GCCTGACCTG 
ATGGGCAGCC 
TCTTCCTCTA 
CATGCTCCGT 
CcCCGGGGAG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
CAAGGACACC 
CCACGAAGAC 
CAAGACAAAG 
CGTCCTGCAC 
CCTCCCAGCC 
GGTGTACACC 
CCTGGTCAAA 
GGAGAACAAC 
CAGCAAGCTC 
GATGCATGAG 
CGGCGGGCTG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
ACTACTACGA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAATG 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGGGG 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAC 
CTCATGATCT 
CCTGAGGTCA 
CCGCGGGAGG 
CAGGACTGGC 
CCCATCGAGA 
CTGCCCCCAT 
GGCTTCTATC 
TACAAGACCA 
ACCGTGGACA 
GCTCTGCACA 
AAGATCGCAG 
GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
GACGTCATGT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 
6GCGGACCCA 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
TCCTGGGGGG 
CCCGGACCCC 
AGTTCAACTG 
AGCAGTACAA 
TGAATGGCAA 
AAACCATCTC 
CCCGGGATGA 
CCAGCGACAT 
CGCCTCCCGT 
AGAGCAGGTG 
ACCACTACAC 
CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
TGATGGGCGA 
GCCTGTGGAC 
CACCCACGCA 
TTGTTCCCGA 
TGGCCCAAGC 
AAAAGAAGCG 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
ACCGTCAGTC 
TGAGGTCACA 
GTACGTGGAC 
CAGCACGTAC 
GGAGTACAAG 
CAAAGCCAAA 
GCTGACCAAG 
CGCCGTGGAG 
GCTGGACTCC 
GCAGCAGGGG 
GCAGAAGAGC 
CCAGACATTT 
CCTGAGCCGC 
GGGGAAGCTG 
TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CATCAGTGAC 
CAAGGTTTGA 
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225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSS LRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

W G Q G T L V T V S S A S T K G P S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAP SSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSLGTQ 
657 666 675 684 693 702 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA * >\> O 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA. ACT CAC ACA TGC - CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA. CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AA.G GAC ACC 

ELLGG PSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

L'MISRTPEVTCVVVDVSH 

87 3 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 



V 



AKTKPREEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



VLTVLHQDWLNGKEYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 



VSNKALPAPIEKTI SKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 



GQPREPQVYTLPPS RDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 



TKNQVS LTCLVKG F Y PSD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 



IAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 



PPVLDSDGSFFLYS KLTV 

1305 1314 1323 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 



DKSRWQQGNVFSCSVMHE 
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13S9 1368 1377 1386 1395 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 



LHNHYTQKSLSLSP G S G 



1413 1422 1431 1440 1449 1458 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG AC A TTT GGG GAG ACC AAG ATG TCC 



GLK I A A F N IQTFGETKMS 



1467 1476 1485 1494 1503 1512 

AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC GCC 



NATLVSYIVQILSRYDIA 

1521 1530 1539 1548 1557 1566 

CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 



LVQEVRDSHLTAVGKLLD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 



NLNQDAPDTYHYVVSEPL 

1629 1638 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 



GRNSYKERYLFVYRPDQV 

1683 1692 1701 1710 1719 1728 

TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC GAC 



SAVDSYYYDDGCEPCGND 

1737 1746 1755 1764 1773 1782 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 



TFNREPAIVRFFSRFTEV 

1791 1800 1809 1818 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 



REFAIV PLHAAPGDAVAE 

1845 1854 1863 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG GAG 



IDALYDVYLDVQEKWG LE 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 



DVMLMG DFMAGCS YVR PS 

1953 1962 1971 1980 1989 1998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC CCC 



Q W S S IRLWTSPTFQWL I P 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 



DSADTTATPTKCAYDR IV 
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2061 2070 2079 2088 2097 2106 

GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 

V A G MLLRGAVVPDSAL P F 

2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC . CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 

NFQAAYGLSDOLAQAISD 

2169 2178 2187 2196 2205 2214 

CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG CGC AAG 

HY P V EVMLKGGGF K K K R K 



GTT TGA 3 ' 
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pASlOl 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 



PASlOl. DNA 1548 bp mRNA PRI 06-MAR-1995 

Humanised HMFG1 Fab' 2 fused to human DNase I (pASlOl) 



DNase I . 

DNase I sequence is from assembled oligos (thus modified c/f 



MHDNASE1 . dna) 

ORGANISM Homo sapiens 



K ! 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 

// 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D.J. , Hellmiss,R., Marsters, S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



sputum 
Proc. Natl 
91067672 
343 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
467 c 430 g 308 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCAACATCC 
GTGCAGATCC 
ACTGCCGTGG 
GTGGTCAGTG 
CCTGACCAGG 
AACGACACCT 
AGGGAGTTTG 
GCTCTCTATG 
ATGGGCGACT 
CTGTGGACAA 
CCCACGCACT 
GTTCCCGACT 
GCCCAAGCCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
AGACATTTGG 
TGAGCCGCTA 
GGAAGCTGCT 
AGCCACTGGG 
TGTCTGCGGT 
TCAACCGAGA 
CCATTGTTCC 
ACGTCTACCT 
TCAATGCGGG 
GCCCCACCTT 
GTGCCTATGA 
CGGCTCTTCC 
TCAGTGACCA 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
GGAGACCAAG 
CGACATCGCC 
GGACAACCTC 
ACGGAACAGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CTATCCAGTG 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAA6 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GCGGGCTGAA 



ATGTCCAATG 
CTGGTCCAGG 
AATCAGGACG 
TATAAGGAGC 
TACTACGATG 
GTCAGGTTCT 
GCCCCGGGGG 
GAGAAATGGG 
GTGAGACCCT 
ATCCCCGACA 
GTTGCAGGGA 
CAGGCTGCCT 
GAGGTGATGC 



CCACCCTCGT 
AGGTCAGAGA 
CACCAGACAC 
GCTACCTGTT 
ATGGCTGCGA 
TCTCCCGGTT 
ACGCAGTAGC 
GCTTGGAGGA 
CCCAGTGGTC 
GCGCTGACAC 
TGCTGCTCCG 
ATGGCCTGAG 
TGAAGTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
CAGCCACCTG 
CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
CACAGAGGTC 
CGAGATCGAC 
CGTCATGTTG 
ATCCATCCGC 
CACAGCTACA 
AGGGGCCGTT 
TGACCAACTG 



Fig. 13(A) 
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LOCUS FDDNASE101 1548 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. . >720,<781. .154 8) 

/note="l to 1548 of PASlOl.dna [Split]" 

frag 721. . 780 

/note="l to 60 of 101/1051inker" 

frag join (721. .>7 35,<7 36. .>759,<760. .>780) 

/note="l to 80 of 1021inker [Split]" 

BASE COUNT 343 A 4 65 C 431 G 309 T 0 OTHER 
ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA GATCGCAGCC 

781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 

841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 

901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC CTATCACTAC 

961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 

1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 

1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 

1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 

1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 

12 61 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 

1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 

1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 

14 41 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 

1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGTGA 

// 



Fig. 13(B) 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 

BASE COUNT 



FDDNASE101 



1557 BP SS-DNA 



SYN 



29-AUG-2000 



344 A 



Location/Qualifiers 
10. . 1557 

/note="l to 1548 of FdDNaselOlcorrect " 
join (10. .>72 9,<7 90. .1557) 
/note="l to 1548 of PASlOl.dna [Split] 
730. .789 

/note="l to 60 of 101/1051inker" 
join (730. . >744,<745. . >7 68,<7 69. .>789) 
/note="l to 80 of 1021inker [Split]" 

471 C 433 G 309 T 0 OTHER 



ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAG C AAC AG C 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG G C AG AG G T G A AAAAGCCTGG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAG G AC AC AG CCGTCTATTA 
3 61 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA 
721 AAATCTTGTG AC AAAAC T C A C AC AT G T C C A CCGTGTCCAG CACCAGAGGG 
7 81 ATCGCAGCCT TCAACATCCA GACATTTGGG GAGACCAAGA TGTCCAATGC 
841 AGCTACATTG TGCAGATCCT GAGCCGCTAC GACATCGCCC TGGTCCAGGA 
901 AGCCACCTGA CTGCCGTGGG GAAGCTGCTG G AC AAC C T C A ATCAGGACGC 
961 TAT C AC T AC G TGGTCAGTGA GCCACTGGGA C G G AAC AG C T ATAAGGAGCG 
1021 GTGTACAGGC CTGACCAGGT GTCTGCGGTG GACAGCTACT AC T AC GAT GA 
1081 CCCTGCGGGA AC GACACC T T CAACCGAGAG CCAGCCATTG TCAGGTTCTT 
1141 AC AG AGG T CA GGGAGTTTGC CATTGTTCCC CTGCATGCGG CCCCGGGGGA 
1201 GAG AT C G AC G CTCTCTATGA CGTCTACCTG GATGTCCAAG AGAAATGGGG 
12 61 GTCATGTTGA TGGGCGACTT CAATGCGGGC TGCAGCTATG TGAGACCCTC 
1321 TCCATCCGCC TGTGGACAAG CCCCACCTTC CAGTGGCTGA TCCCCGACAG 
1381 ACAGCTACAC CCACGCACTG TGCCTATGAC AGGATCGTGG TTGCAGGGAT 
1441 GGGGCCGTTG TTCCCGACTC GGCTCTTCCC TTTAACTTCC AGGCTGCCTA 
1501 G AC C AAC T G G CCCAAGCCAT C AG T G AC C AC TATCCAGTGG AGGTGATGCT 

// 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
CGGGCTGAAG 
CACCCTCGTC 
GGTCAGAGAC 
ACCAGACACC 
CTACCTGTTC 
TGGCTGCGAG 
CTCCCGGTTC 
CGCAGTAGCC 
CTTGGAGGAC 
CCAGTGGTCA 
CGCTGACACC 
GCTGCTCCGA 
TGGCCTGAGT 
GAAGTGA 



Fig. 13(C) 



9 18 27 36 45 54 

< 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT ACA GGT GTC CAC 
MGWSCI I LFLVATATGVH 

63 72 8.1 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



f v ■ 

* * ' 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSC'KASGYTFSAYW I E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVGE I L P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQ.GTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQS SG 

Fig. 13(D) 

(Sheet 1 of 3) 



• " ■ " v 



603 612 621 630 639 648 Ci 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG c« 
... ___ ___ _-_ --- ... --- --- --- >%../ 



*J- ..V 



657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPSWTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPC PAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 

EGGLKIAAFNIQTFGETK 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 

MSNATLVSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 

IALVQEVRDS HLTAVGKL 

927 936 945 954 953 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC ACT GAG 

LDNLNQDAPDTYHYVVSE 

981 990 999 1008 1017 1026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 

PLGRNSYKERYLrVYRPD 

1035 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 

QVSAVDSYYYDDGC EPCG 

1089 1098 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 

NDTFNREPAIVRFFSRFT 

1143 1152 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 

EVREFAIVPLHAAPGDAV 
1197 1206 1215 1224 1233 1242 



Fig. 13(D) 

(Sheet 2 of 3) 
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GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 



AEI DAL YDVYLDVQ E KWG 

1251 1260 1269 1278 1287 1296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 



LEDVMLMGDFNAGC S YVR 

1305 1314 1323 . 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 



PSQWSSIRLWTSPTFQWL 

Q 1359 1368 1377 1386 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



I PDSAD TTATPT HC A YD R 

1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 



IVVAGMLLRGAVVPD SAL 

1467 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 



PrNFQAAYGLSDQLAQAI 

1521 1530 1539 1548 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



SDHYPVEVMLK 



Fig. 13(D) 
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PAS102 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 



PAS102.DNA 1566 bp mRNA PRI 06-MAR-1995 

Humanised HMFG1 Fab ' 2 fused to human DNase I (pAS102) 



DNase I . 

DNase I sequence is from assembled oligos (thus modified c/f 
MH DNASE 1 . dna) (See Figure 2) 
ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J., Hellmiss,R., Marsters , S . A. and Baker, C.L 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 

Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

345 a 469 c 440 g 312 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



1 

61 GTGCAGCTGG 
121 TGCAAGGCTT 
181 GGAAAGGGCC 
241 GAGAAGTTCA 
301 GAGCTCAGCA 
361 TTTGCCTGGT 
421 AAGGGCCCAT 
481 GCCCTGGGCT 
541 GGCGCCCTGA 
601 TCCCTCAGCA 
661 AACGTGAATC 
721 GACAAAACTC 
781 GGG CTGAAGA 
841 ACCCTCGTCA 
901 GTCAGAGACA 
961 CCAGACACCT 
1021 TACCTGTTCG 
1081 GGCTGCGAGC 
1141 TCCCGGTTCA 
1201 GCAGTAGCCG 
12 61 TTGGAGGACG 
1321 CAGTGGTCAT 
1381 GCTGACACCA 
14 41 CTGCTCCGAG 
1501 GGCCTGAGTG 
1561 AAGTGA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGGAGCGGC 



TCGCAGCCTT 
GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 
TGTACAGGCC 
CCTGCGGGAA 
CAGAGGTCAG 
AGATCGACGC 
TCATGTTGAT 
CCATCCGCCT 
CAGCTACACC 
GGGCCGTTGT 
ACCAACTGGC 



CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 
TGACCAGGTG 
CGACACCTTC 
GGAGTTTGCC 
TCTCTATGAC 
GGGCGACTTC 
GTGGACAAGC 
CACGCACTGT 
TCCCGACTCG 
CCAAGCCATC 



ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 
TCTGCGGTGG 
AACCGAGAGC 
ATTGTTCCCC 
GTCTACCTGG 
AATGCGGGCT 
CCCACCTTCC 
GCCTATGACA 
GCTCTTCCCT 
AGTGACCACT 



AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 
ACAGCTACTA 
CAGCCATTGT 
TGCATGCGGC 
ATGTCCAAGA 
GCAGCTATGT 
AGTGGCTGAT 
GGATCGTGGT 
TTAACTTCCA 
ATCCAGTGGA 



GTCCAATGCC 
GGTCCAGGAG 
TCAGGACGCA 
TAAGGAGCGC 
CTACGATGAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 



// 



Fig. 14(A) 
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LOCUS FDDNASE102 1566 BP SS-DNA SYN 23-MAR-2001 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

BASE COUNT 345 A 4 68 C 440 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
4 81 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC C AG C AC C AG A GGGGAGCGGC 
7 81 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 
1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
12 61 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
14 41 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGTGA 

// 



Fig. 14(B) 
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PAS302 
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LOCUS FDDNASE302 1575 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1575 

/note="l to 1566 of FdDNasel02correct " 
BASE COUNT 346 A 474 C 442 G 313 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAG AC AC AT C CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
4 81 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
781 GGGAGCGGCG GGCTGAAGAT CGCAGCCTTC AACATCCAGA CATTTGGGGA GACCAAGATG 
841 TCCAATGCCA CCCTCGTCAG CTACATTGTG CAGATCCTGA GCCGCTACGA CATCGCCCTG 
901 GTCCAGGAGG TCAGAGACAG CCACCTGACT GCCGTGGGGA AGCTGCTGGA CAACCTCAAT 
961 CAGGACGCAC CAGACACCTA TCACTACGTG GTCAGTGAGC CACTGGGACG GAACAGCTAT 
1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA CAGCTACTAC 
1081 T AC GAT GAT G GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC AGCCATTGTC 
1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT GCATGCGGCC 
1201 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA TGTCCAAGAG 
12 61 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG CAGCTATGTG 
1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA GTGGCTGATC 
1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG GATCGTGGTT 
14 41 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT TAACTTCCAG 
1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA TCCAGTGGAG 
1561 GTGATGCTGA AGTGA 

// 



Fig. 14(C) 



9 10 27 36 45 . . ,>V-54 

5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GOT GTC CAC 

M G W S C I I L F L V A T A T G V ' H > % ;* 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



s 3 



VKVSCKASGYTFSAYW I E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVG E I LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AA.T GAG AAG TTC AAG GGC CGA GTG AC A GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC AC A AAC AC A GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF PL A PSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA. CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 
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603 G12 621 630 639 648/.' 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG-? 



LYSLSSVVTVPSSSLGTQ 

• \ : \ * • 

657 666 675 684 693 702 " - 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPS--NTKVDKK 

711 720 729 • 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVECPP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 



CPAPEGSGGLKIAAFNIQ 

819 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



TFGETKMSNATLV SYIVQ 

873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDI ALVQEVRDSHL, 

927 936 945 954 963 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC .AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLLDNLWQDAPDTY 

981 990 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 



HYVVSEPLGRNSY KERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



FVYRPDQVSAVDSYYYDD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GCEPCGNDTFNRE PAIVR 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



FPSRFTEVREFA IVPLHA 
1197 1206 1215 1224 1233 1242 
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GCG CCG GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT 



APGDAVAEIDALYDVY LD 

1251 1260 1269 1278 1287 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG 



V Q EKWG L EDVMLMG D F NA 

1305 1314 1323 ' 1332 1341 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC 

5^ GCSYVRPSQWSSIRLWTS 

1359 1368 1377 1386 1395 1404 

!-> CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 



PTFQWL I PDSADTTAT PT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGC ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

jI3Z — — — — — — — — "~ — — — — ~ — — — 

V s : HCAYDRIVVAGMLL R G A V 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 



VPDSALP FNFQAAYGLSD 

1521 1530 1539 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



QLAQA I S DHYPVEVMLK 
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PAS103 



St) : 
5 XT. 



LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 



PAS103.DNA 1560 bp mRNA PRI 06-MAR-1995 

Humanised HMFG1 Fab ' 2 fused to human DNase I (pAS103) 



DNase I. 

DNase I sequence is from assembled oligos (thus modified c/f 



MHDNASE1 . dna) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 

// 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J., Hellmiss,R., Marsters, S.A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



sputum 
Proc. Natl 
91067672 
344 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
468 c 436 g 312 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
AAGATCGCAG 
GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
GACGTCATGT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
AC AC ATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGCGGGCTG 



CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
TGATGGGCGA 
GCCTGTGGAC 
CACCCACGCA 
TTGTTCCCGA 
TGGCCCAAGC 



CCAGACATTT 
CCTGAGCCGC 
GGGGAAGCTG 
TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CATCAGTGAC 



GGGGAGACCA 
TACGACATCG 
CTGGACAACC 
GGACGGAACA 
GTGGACAGCT 
GAGCCAGCCA 
CCCCTGCATG 
CTGGATGTCC 
GGCTGCAGCT 
TTCCAGTGGC 
GACAGGATCG 
CCCTTTAACT 
CACTATCCAG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
ACTACTACGA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAATG 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGTGA 



Fig. 15(A) 





as 
3 i 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



FDDNASE103 



1560 BP SS-DNA 



SYN 



25-AUG-2000 



344 A 

ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 



Location/Qualifiers 
join(l. .>720,<7 93. .1560) 

/note="l to 1560 of PAS103.dna [Split]" 
721. .792 

/note="l to 72 of 103/1071inker " 

join (721. .>771,<772. .792) 

/note="l to 78 of 1021inker [Split]" 

467 C 436 G 313 T 0 OTHER 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 



AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC 

GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA 

AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA 

GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA 

GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA 

ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA 

1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA 

1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT 

1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG 

1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAG AG AAAT G 

1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC 

1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA 

1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG 

14 41 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC 

1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GGGCGGGCTG 
TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGTGA 



// 



Fig. 15(B) 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 



FDDNASE103 



1569 BP SS-DNA 



SYN 



29-AUG-2000 



Location/Qualifiers 
10. . 1569 

to 1560 of FciDNasel03correct" 
>729, <802. . 1569) 
to 1560 of PAS103.dna [Split] 



/note="l 
join (10 . 
/note="l 



345 A 



730. . 801 

/note="l to 72 of 103/1071inker" 

join (730. .>780,<781. .801) 

/note="l to 78 of 1021inker [Split]" 

473 C 438 G 313 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GCCGCCACC A TGGGATGGAG CTGTATCATC CTCTTCTTGG TAG C AAC AG C 

CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG 

AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACT GGAT AGA 

CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG 

AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC 

GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA 

TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC 

GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG 

GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT 

TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT 

GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG 

TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA 

AAATCTTGTG AC AAAAC T C A CACATGCTGT GTCGAGTGTC CACCGTGTCC 

GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGG AGAC C AA 

GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC 

GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT 

GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG G AC G G AAC AG 



61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA 
1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT 
1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC 
12 01 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA 
1261 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTCAATGCGG GCTGCAGCTA 
1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT 
1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT 
14 41 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT 
1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT 
1561 CTGAAGTGA 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
AGCACCAGAG 
GATGTCCAAT 
CCTGGTCCAG 
CAATCAGGAC 
CTATAAGGAG 
CTACTACGAT 
TGTCAGGTTC 
GGCCCCGGGG 
AG AG AAAT G G 
TGTGAGACCC 
GATCCCCGAC 
GGTTGCAGGG 
CCAGGCTGCC 
GGAGGTGATG 



// 



Fig. 15(C) 



9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC CAC » , n 

M G W S C I ILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKK PGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAYW I E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGE I L P ■ 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AG A TAC AAT GAG AAG TTC AAG GGC CGA GTG AC A GTC ACT 

GSNNSRYNSKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC AC A TCC ACA AAC AC A GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSS LRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAW FAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSAST KG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



SGVHTFPAV.LQS 
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LYSLSS VVTVPSSSLGT .Q 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

T Y ICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 

VEPKSCDKTHTCCVECPP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 

CPAPEGGLKIAAFNIQTF 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 

GETKMSNATLVSYIVQIL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC 

SRYDIALVQEVRDSHLTA 

927 936 945 954 9-63 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 

VGKLLDNLNQDAPDTY HY 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA. CGG AAC AGC TAT AAG GAG CGC TA.C CTG TTC GTG 

VVSEPLG RNSYKERYLrV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 

YRPDQVSAVDSYYYDDGC 

1089 1098 1107 1116 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 

EPCGNDTFNREPAIVRFF 

1143 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 

SRFTEVREFAIVPLHAAP 

1197 1206 1215 1224 1233 L242 
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60 3 612 621 630 63 9 _'j64 8 , 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC '"C AG ... \ ' '* 



GGG 


GAC GCA 


GTA 


GGC 


GAG 


ATC 


GAC 


GCT 


CTC 


TAT 


GAC 


GTC 
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GAT 
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0 




12 51 






1260 






1269 






1278 






1287 






1296 


GAG 


AAA TGG 


GGG 


TTG 


GAG 


GAC 


GTC 


ATG 


TTG 


ATG 


GGC 


GAC 


TTC 


AAT 


GCG 


GGC 


TGC 
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K W 
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L 
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F 


N 


A 


G 


C 




13 05 






1314 






1323 






1332 






1341 






1350 


AGC 


TAT GTG 


AGA 


CCC 


TCC 


CAG 


TGG 


TCA 


TCC 


ATC 


CGC 


CTG 


TGG 


ACA 


AGC 
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1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 



FQWLI PDSADTTAT PTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 



AY DR IVVAGMLLRGAVV P 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 



DSAL PFNFQAAYG LSDQL 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 



A Q A I SDHYPVEVML 
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PAS104 V o ; 



LOCUS PAS 104. DNA 1560 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab f 2 fused to human DNase I (pAS104) 

Position 924 G to A by ggg to gag 

Linker GR instead of GG (position 777) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna ) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shak,S., Capon, D.J. , Hellmiss,R., Marsters, S . A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 

fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 346 a 468 c 434 g 312 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GG CAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG AT C C T AC G AC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGG CACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGC C C AG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGCAGG CTG 
7 81 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 G T C AG C T AC A TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGAGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 AC C TAT C ACT ACGTGGTCAG TGAGCCACTG GGAC GGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT AC T AC T AC G A TGATGGCTGC 
1081 GAGCCCTGCG G G AAC G AC AC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 T T C AC AG AGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGAC G C AG T A 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
12 61 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT AT G T GAG AC C CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AG T G AC C AAC TGGCCCAAGC CAT C AG T G AC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 

// 



Fig. 1 6(A) 
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E S3 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



FDDNASE104 



1560 BP SS-DNA 



SYN 



25-AUG-2000 



346 A 

ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGT TC A 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
AAGATCGCAG 



Location /Qualifiers 
joinfl. . >720,<7 93. .1560) 

/note="l to 1560 of PAS104.cina [Split]" 
721. .792 

/note="l to 72 of 1041inker" 

join (721. .>774,<776. .792) 

/note="l to 72 of 1031inker [Split]" 

join (721. . >771,<772. .>774, <776. .792) 

/note="l to 78 of 1021inker [Split]" 

467 C 434 G 313 T 0 OTHER 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 
CCTTCAACAT 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAGGACAC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTCGAGTGT 
CCAGACATTT 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGTC 
GGGGAGACCA 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCAGA 
AGATGTCCAA 



GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA 
GACAGCCACC TGACTGCCGT GGAGAAGCTG CTGGACAACC TCAATCAGGA 
ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG 
14 41 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GGGCAGGCTG 
TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGTGA 



// 



Fig. 1 6(B) 



18 27 36 45 ^ 54. 



5 ' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC CAC 
H G W S C I I L F L V A T A T G Xv^^Kj . 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 
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5 — s 



5.2 a 



GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA 
GDAVAEXDALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGDFNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVRPSQWSSIRLWTSPT 

1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWLI PDSADTTAT PTHC 

1413 1422 1431 1440 1449 1458 

GCC TA/T GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRIVVAGMLLRGAVVP 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

D'SA L P FN-FQAAYG L S D Q "L 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

A OA IS DHYPVEVMLK * 
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PAS105 



^ - 



LOCUS 

DEFINITION 
NLS (pAS105) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 



PAS105.DNA 1578 bp mRNA PRI * - OjSpMAR*- 1 9 9 5 

Humanised HMFG1 Fab 1 2 fused to human DNase I with SV4 0* 



DNase I . 

DNase I sequence is from assembled oligos (thus modified c/f 



MHDNASE1 . dna ) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 

// 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shakes., Capon, D. J. , Hellmiss,R., Marsters, S . A. and Baker, C.L 
Recombinant human DNase I reduces the viscosity of cystic 

sputum 

Proc. Natl. Acad. Sci . U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

353 a 473 c 442 g 310 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
TTCAACATCC 
GTGCAGATCC 
ACTGCCGTGG 
GTGGTCAGTG 
CCTGACCAGG 
AACGACACCT 
AGGGAGTTTG 
GCTCTCTATG 
ATGGGCGACT 
CTGTGGACAA 
CCCACGCACT 
GTTCCCGACT 
GCCCAAGCCA 
AAGAAGCGCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCCC 
AGACATTTGG 
TGAGCCGCTA 
GGAAGCTGCT 
AGCCACTGGG 
TGTCTGCGGT 
TCAACCGAGA 
CCATTGTTCC 
ACGTCTACCT 
TCAATGCGGG 
GCCCCACCTT 
GTGCCTATGA 
CGGCTCTTCC 
TCAGTGACCA 
AGGTTTGA 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T G AGGAC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
ACCGTGCCCA 
GGAGACCAAG 
CGACATCGCC 
GGACAACCTC 
AC GGAAC AGC 
GGACAGCTAC 
GCCAGCCATT 
CCTGCATGCG 
GGATGTCCAA 
CTGCAGCTAT 
CCAGTGGCTG 
CAGGATCGTG 
CTTTAACTTC 
CTATCCAGTG 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
GCACCTGAAG 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
GCGGGCTGAA 



ATGTCCAATG 
CTGGTCCAGG 
AATCAGGACG 
TATAAGGAGC 
TACTACGATG 
GTCAGGTTCT 
GCCCCGGGGG 
GAGAAATGGG 
GTGAGACCCT 
ATCCCCGACA 
GTTGCAGGGA 
CAGGCTGCCT 
GAGGTGATGC 



CCACCCTCGT 
AGGTCAGAGA 
CACCAGACAC 
GCTACCTGTT 
ATGGCTGCGA 
TCTCCCGGTT 
ACGCAGTAGC 
GCTTGGAGGA 
CCCAGTGGTC 
GCGCTGACAC 
TGCTGCTCCG 
ATGGCCTGAG 
TGAAGGGGGG 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
CAGCCACCTG 
CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
CACAGAGGTC 
CGAGATCGAC 
CGTCATGTTG 
ATCCATCCGC 
CACAGCTACA 
AGGGGCCGTT 
TGACCAACTG 
CGGACCCAAA 



>NLS 



Fig. 1 7(A) 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 



FDDNASE105 



1578 BP SS-DNA 



SYN 



25-AUG-20.00 - 



v; 



353 A 



Location /Qualifiers 
join(l. . >720,<7 81. .157 8) 
/note="l to 1578 of PASlOS.dna [Split] 
721. .780 

/note="l to 60 of 101/1051inker" 
join (721. . >7 35,<7 36. .>759,<7 60. .>7 80) 
/note="l to 80 of 1021inker [Split]" 

471 C 443 G 311 T 0 OTHER 



a 



ESS- 



BASE COUNT 
ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT 
TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG 
GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC 
GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC 
GAGCTCAGCA GCCTGAGGTC T GAG GAC AC A GCCGTCTATT ACTGTGCAAG 
TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC 
AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG 
GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC 
GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC 
TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 
AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC 
GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA 
TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT 
GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA 
ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC 
GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA 
12 61 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG 
14 41 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGGGGGG 
1561 AAGAAGCGCA AGGTTTGA 

// 



121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GATCGCAGCC 
CAGCTACATT 
CAGCCACCTG 
CTATCACTAC 
CGTGTACAGG 
GCCCTGCGGG 
CACAGAGGTC 
CGAGATCGAC 
CGTCATGTTG 
ATCCATCCGC 
CACAGCTACA 
AGGGGCCGTT 
TGACCAACTG 
CGGACCCAAA 



Fig. 1 7(B) 



V 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 

// 



FDDNASE105 



1587 BP SS-DNA 



SYN 



29-AUG-2000 



354 A 

GCCGCCACC A 
CACTCCCAGG 
AAGGTGTCCT 
CAGGCTCCAG 
AGATACAATG 
GCCTACATGG 
TCCTACGACT 
GCCTCCACCA 
GGCACAGCGG 
TGGAACTCAG 
GGACTCTACT 
TACATCTGCA 
AAATCTTGTG 
ATCGCAGCCT 
AGCTACATTG 
AGCCACCTGA 
TATCACTACG 
GTGTACAGGC 
CCCTGCGGGA 
ACAGAGGTCA 
GAGATCGACG 
GTCATGTTGA 
TCCATCCGCC 
ACAGCTACAC 
GGGGCCGTTG 
GACCAACTGG 
GGACCCAAAA 



Location /Qualifiers 
10. . 1587 

/note="l to 1578 of FdDNaselOScorrect" 
join (10. . >729,<7 90. .158 7) 
/note="l to 1578 of PASlOS.dna [Split]" 
730. .789 

/note="l to 60 of 101/1051inker" 
join(730. .>744,<745. .>768,<769. .>789) 
/note="l to 80 of 1021inker [Split]" 

477 C 445 G 311 T 0 OTHER 



TGGGATGGAG 
TGCAGCTGGT 
GCAAGGCTTC 
GAAAGGGCCT 
AGAAGTTCAA 
AGCTCAGCAG 
TTGCCTGGTT 
AGGGCCCATC 
CCCTGGGCTG 
GCGCCCTGAC 
CCCTCAGCAG 
ACGTGAATCA 
ACAAAACTCA 
TCAACATCCA 
TGCAGATCCT 
CTGCCGTGGG 
TGGTCAGTGA 
CTGACCAGGT 
ACGACACCTT 
GGGAGTTTGC 
CTCTCTATGA 
TGGGCGACTT 
TGTGGACAAG 
CCACGCACTG 
TTCCCGACTC 
CCCAAGCCAT 
AGAAGCGCAA 



CTGTATCATC 
GCAGTCTGGG 
TGGCTACACC 
CGAGTGGGTC 
GGGCCGAGTG 
CCTGAGGTCT 
TGCTTACTGG 
GGTCTTCCCC 
CCTGGTCAAG 
CAGCGGCGTG 
CGTGGTGACC 
CAAGCCCAGC 
CACATGTCCA 
GACATTTGGG 
GAGCCGCTAC 
GAAGCTGCTG 
GCCACTGGGA 
GTCTGCGGTG 
CAACCGAGAG 
CATTGTTCCC 
CGTCTACCTG 
CAATGCGGGC 
CCCCACCTTC 
TGCCTATGAC 
GGCTCTTCCC 
CAGTGACCAC 
GGTTTGA 



CTCTTCTTGG 
GCAGAGGTGA 
TTCAGTGCCT 
GGAGAGATTT 
ACAGTCACTA 
GAGGACACAG 
GGCCAAGGGA 
CTGGCACCCT 
GACTACTTCC 
CACACCTTCC 
GTGCCCTCCA 
AACACCAAGG 
CCGTGTCCAG 
GAGACCAAGA 
GACATCGCCC 
GACAACCTCA 
CGGAACAGCT 
GACAGCTACT 
CCAGCCATTG 
CTGCATGCGG 
GATGTCCAAG 
TGCAGCTATG 
CAGTGGCTGA 
AGGATCGTGG 
TTTAACTTCC 
TATCCAGTGG 



TAGCAACAGC 
AAAAGCCTGG 
ACTGGATAGA 
TACCTGGAAG 
GAGACACATC 
CCGTCTATTA 
CTCTGGTCAC 
CCTCCAAGAG 
CCGAACCGGT 
CGGCTGTCCT 
GCAGCTTGGG 
TGGACAAGAA 
CACCAGAGGG 
TGTCCAATGC 
TGGTCCAGGA 
ATCAGGACGC 
ATAAGGAGCG 
ACTACGATGA 
TCAGGTTCTT 
CCCCGGGGGA 
AGAAAT GGGG 
TGAGACCCTC 
TCCCCGACAG 
TTGCAGGGAT 
AGGCTGCCTA 
AGGTGATGCT 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
CGGGCTGAAG 
CACCCTCGTC 
GGTCAGAGAC 
ACCAGACACC 
CTACCTGTTC 
TGGCTGCGAG 
CTCCCGGTTC 
CGCAGTAGCC 
CTTGGAGGAC 
CCAGTGGTCA 
CGCTGACACC 
GCTGCTCCGA 
TGGCCTGAGT 
GAAGGGGGGC 



Fig. 1 7(C) 



m » 



* 



9 18 27 36 45 ' S'£ I 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC CAC 



* A 1 



H G W S C I I LFLVATATGV ''Hn W \ /- 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSC KASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



3*3 - 



3C ; 



WVRQA PGKGLEWVGEI LP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKG RVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELS SLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 



387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVS SASTKG PS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPE PVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

G A L T S G V H T F P A V L Q S S G 

Fig. 17(D) 

(Sheet 1 of 3) 



\ 



603 612 621 630 639 ,6:48 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPC PAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 

EGGLKIAAFNIQTFGETK 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 

MSNATLVSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 

IALVQEVRDSHLTAVGKL 

927 '936 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 

LDNLNQDAPDTYHYVVSE 

981 990 9 99 100 8 1017 1026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 

PLGRNSYKERYLFVYRPD 

1035 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG -GAC AGC -TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 

QVSAVDSYYYDDGC EPCG 

1089 1098 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 

NDTFNREPAIVRFFSRFT 

1143 M52 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 

EVREFAIVPLHAA PGDAV 
1 197 1206 1215 1224 1233 12*32 



Fig. 1 7(D) 
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i, i 



+* * * 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 



A E I D A L Y DVYLDV Q E K WG 

1251 1260 1269 _ 1278 1287 1296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AG A 



LEDVMLMGDFNAGC SYVR 

1305 1314 1323 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG AC A AGC CCC ACC TTC CAG TGG CTG 



PSQWSSIRLWTSPTFQWL 

1359 1368 1377 1386 1395 1404 

ATC CCC GAC AGC GCT GAC ACC AC A GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 



IPDSADTTATPTHCAYDR 

1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA GGG ATG , CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 



IVVAGMLLRGAVVPDSAL 

1467 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 



PFNFQAAYGLSDQLAQAI 

1521 1530 1539 1548 1557 1566 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA. AAG AAG 



SDHY PVEVMLKGGGPKKK 
1575 

CGC AAG GTT TGA 3 ' 



R K V 



Fig. 1 7(D) 
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PAS106 



VJ 1 



LOCUS 

DEFINITION 
NLS (pAS106) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 



PAS106.DNA 1596 bp mRNA PRI 06-MAR-1995 

Humanised HMFG1 Fab 1 2 fused to human DNase I with SV4 0 



DNase I 
DNase I 



sequence is from assembled oligos (thus modified c/f 



MHDNASE1 . dna ) 

ORGANISM Homo sapiens 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 

// 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J. , Hellmiss,R., Marsters , S . A. and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



sputum 
Proc. Natl 
91067672 
355 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
475 c 452 g 314 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GGG CTGAAGA 
ACCCTCGTCA 
GTCAGAGACA 
CCAGACACCT 
TACCTGTTCG 
GGCTGCGAGC 
TCCCGGTTCA 
GCAGTAGCCG 
TTGGAGGACG 
CAGTGGTCAT 
GCTGACACCA 
CTGCTCCGAG 
GGCCTGAGTG 
AAGGGGGGCG 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GT GG AC AAG A 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGGftGCGGC 



TCGCAGCCTT 
GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 
TGTACAGGCC 
CCTGCGGGAA 
CAGAGGTCAG 
AGATCGACGC 
TCATGTTGAT 
CCATCCGCCT 
CAGCTACACC 
GGGCCGTTGT 
ACCAACTGGC 
GftCCCAAAAA 



CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 
TGACCAGGTG 
CGACACCTTC 
GGAGTTTGCC 
TCTCTATGAC 
GGGCGACTTC 
GTGGACAAGC 
CACGCACTGT 
TCCCGACTCG 
CCAAGCCATC 
GAAGCGCAAG 



ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 
TCTGCGGTGG 
AACCGAGAGC 
ATTGTTCCCC 
GTCTACCTGG 
AATGCGGGCT 
CCCACCTTCC 
GCCTATGACA 
GCTCTTCCCT 
AGTGACCACT 
GTTTGA 



AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 
ACAGCTACTA 
CAGCCATTGT 
TGCATGCGGC 
ATGTCCAAGA 
GCAGCTATGT 
AGTGGCTGAT 
GGATCGTGGT 
TTAACTTCCA 
ATCCAGTGGA 



GTCCAATGCC 
GGTCCAGGAG 
TCAGGACGCA 
TAAGGAGCGC 
CTACGATGAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 



NLS 



Fig. 18(A) 



- i 

■ x. 



•ear 



a ^ s 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



FDDNASE106 



1596 BP SS-DNA 



SYN 



v '*25AaUG-2000 



IT 



355 A 

ATGGGATGGA 
GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
GGGCTGAAGA 
ACCCTCGTCA 
GTCAGAGACA 
CCAGACACCT 



Location /Qualifiers 
join(l. .>720,<7 99. .1596) 
/note="l to 1596 of PAS106.dna [Split] 
721. .798 

/note="l to 78 of 102/1061inker M 

474 C 452 G 315 T 0 OTHER 



GCTGTATCAT 
TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 
TCGCAGCCTT 
GCTACATTGT 
GCCACCTGAC 
ATCACTACGT 



CCTCTTCTTG 
GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
TGAGGACACA 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTCGAGTGT 
CAACATCCAG 
GCAGATCCTG 
TGCCGTGGGG 
GGTCAGTGAG 



GTAGCAACAG 
AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGTC 
ACATTTGGGG 
AGCCGCTACG 
AAGCTGCTGG 
CCACTGGGAC 



CTACAGGTGT 
GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCAGA 
AGACCAAGAT 
ACATCGCCCT 
ACAACCTCAA 
GGAACAGCTA 



1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AAC C GAG AG C CAGCCATTGT 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA 
12 61 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT 
14 41 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA 
1561 AAGGGGGGCG GACCCAAAAA GAAGCGCAAG GTTTGA 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GGGGAGCGGC 
GTCCAATGCC 
GGTCCAGGAG 
TCAGGACGCA 
TAAGGAGCGC 
CTACGATGAT 
CAGGTTCTTC 
CCCGGGGGAC 
GAAATGGGGC 
GAGACCCTCC 
CCCCGACAGC 
TGCAGGGATG 
GGCTGCCTAT 
GGTGATGCTG 



// 



Fig. 18(B) 



a — —3 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



FDDNASE10 6 



1605 BP SS-DNA 



SYN 



29-AUG-2000 



Location /Qualifiers 
10. . 1605 

to 1596 of FdDNasel06correct" 
>729, <808. . 1605) 
to 1596 of PAS106.dna [Split] 



356 A 

GCCGCCACC A 
CACTCCCAGG 
AAGGTGTCCT 
CAGGCTCCAG 
AGATACAATG 
GCCTACATGG 
TCCTACGACT 
GCCTCCACCA 
GGCACAGCGG 
TGGAACTCAG 
GGACTCTACT 
TACATCTGCA 
AAATCTTGTG 
GGGAGCGGCG 
TCCAATGCCA 
GTCCAGGAGG 



/note="l 
join { 10 . 
/note="l 
730. . 807 
/note="l 
480 



ii 



to 78 of 102/1061inker" 
: 454 G 315 T 



TGGGATGGAG 
TGCAGCTGGT 
GCAAGGCTTC 
GAAAGGGCCT 
AGAAGTTCAA 
AGCT CAGCAG 
TTGCCTGGTT 
AGGGCCCATC 
CCCTGGGCTG 
GCGCCCTGAC 
CCCTCAGCAG 
ACGTGAATCA 
ACAAAACTCA 
GGCTGAAGAT 
CCCTCGTCAG 
TC AG AG AC AG 
CAGACACCTA 



CTGTATCATC 
GCAGTCTGGG 
TGGCTACACC 
CGAGTGGGTC 
GGGCCGAGTG 
CCTGAGGTCT 
TGCTTACTGG 
GGTCTTCCCC 
CCTGGTCAAG 
CAGCGGCGTG 
CGTGGTGACC 
CAAGCCCAGC 
CACATGCTGT 
CGCAGCCTTC 
CTACATTGTG 
CCACCTGACT 
TCACTACGTG 



CTCTTCTTGG 
GCAGAGGTGA 
TTCAGTGCCT 
GGAGAGATTT 
ACAGTCACTA 
GAGGACACAG 
GGCCAAGGGA 
CTGGCACCCT 
GACTACTTCC 
CACACCTTCC 
GTGCCCTCCA 
AACACCAAGG 
GTCGAGTGTC 
AACATCCAGA 
CAGATCCTGA 
GCCGTGGGGA 
GTCAGTGAGC 



0 OTHER 

TAGCAACAGC 
AAAAGCCTGG 
ACTGGATAGA 
TACCTGGAAG 
GAGACACATC 
CCGTCTATTA 
CTCTGGTCAC 
CCTCCAAGAG 
CCGAACCGGT 
CGGCTGTCCT 
GCAGCTTGGG 
TGGACAAGAA 
CACCGTGTCC 
CATTTGGGGA 
GCCGCTACGA 
AGCTGCTGGA 
CACTGGGACG 



CAGGACGCAC 

1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA 

1081 T AC GAT GAT G GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC 

1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT 

12 01 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA 

12 61 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG 

1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA 

1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG 

14 41 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT 

1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA 

1561 GTGATGCTGA AGGGGGGCGG ACCCAAAAAG AAGCGCAAGG TTTGA 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
AGCACCAGAG 
GACCAAGATG 
CATCGCCCTG 
CAACCTCAAT 
GAACAGCTAT 
CAGCTACTAC 
AGCCATTGTC 
GCATGCGGCC 
TGTCCAAGAG 
CAGCTATGTG 
GTGGCTGATC 
GATCGTGGTT 
TAACTTCCAG 
TCCAGTGGAG 
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ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC CAC "** 

S^/ 

M G W S C I I L F L V A T A T G V H ^ /; , t V^. y 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFS AYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQAPGKGLEWVG EILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



G SNN .S RYN EK F KG R VTV .T 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELS SLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 
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1% f1 



VVTVPSSSLGTQ 



f t 



657 666 675 684 693 t02 ' 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



\ 



603 612 621 630 639 ,v. - 548 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC* ACC C AG &\ 

I 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVECPP 

765 774 783 792 801 810 

TGC CCA GCA CGT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG 



CPAPEG SGGLKI AAFNIQ 

819 828 837 846 855 864 

AC A TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



■=7 TFGETKMSNATLVSYIVQ 

873 882 891 900 909 918 

Ei ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDIALVQEVRDSHL 

927 936 945 954 963 572 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLLDNLNQDAPDTY 

981 990 999 1008 1017 10 26 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 



HYVVSE PLGRNSYKERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



FVYRPDQVSA.VDSYYYDD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GCEPCGNDTFNRE PA I V R 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



F F S R FT E V R E F A I V P L H A 
1197 1206 1215 1224 1233 1242 
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GCC CCG GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT 
APGDAVAEIDALYD', VYLD 

1251 1260 1269 1278 1287 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT ,GCG 

VQEKWG LEDVMLMGDFNA 

1305 1314 1323 1332 1341 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG AC A AGC 

GCSYVR PSQW SS I RLWTS 

1359 1368 1377 1386 1395 1404 

CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC AC A GCT ACA CCC ACG 

PTFQWL I PDSADTTATPT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

HCAYD R I V V A G M L L RGAV 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VPDSAL PFNFQAAYGLSD 

1521 1530 1539 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG 

QLAQA I SDHYPVEVMLKG 

1575 1584 1593 

GGC GGA CCC AAA AAG AAG CGC AAG GTT TGA 3 ' 



GGPKKKRKV 
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PAS107 



LOCUS PAS 107 . DNA 1590 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFG1 Fab 1 2 fused to human DNase I with SV40 

NLS <pAS107) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASE1 . dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak,S., Capon, D. J. , Hellmiss,R., Marsters , S . A . and Baker, C.L. 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 

// 



sputum 
Proc. Natl 
91067672 
354 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
474 c 448 g 314 t 



ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 



GTGCAGCTGG 
TGCAAGGCTT 
GGAAAGGGCC 
GAGAAGTTCA 
GAGCTCAGCA 
TTTGCCTGGT 
AAGGGCCCAT 
GCCCTGGGCT 
GGCGCCCTGA 
TCCCTCAGCA 
AACGTGAATC 
GACAAAACTC 
AAGATCGCAG 
GTCAGCTACA 
GACAGCCACC 
ACCTATCACT 
TTCGTGTACA 
GAGCCCTGCG 
TTCACAGAGG 
GCCGAGATCG 
GACGTCATGT 
TCATCCATCC 
ACCACAGCTA 
CGAGGGGCCG 
AGTGACCAAC 
GGCGGACCCA 



TGCAGTCTGG 
CTGGCTACAC 
TCGAGTGGGT 
AGGGCCGAGT 
GCCTGAGGTC 
TTGCTTACTG 
CGGTCTTCCC 
GCCTGGTCAA 
CCAGCGGCGT 
GCGTGGTGAC 
ACAAGCCCAG 
ACACATGCTG 



GGCAGAGGTG 
CTTCAGTGCC 
CGGAGAGATT 
GACAGTCACT 
T GAG G AC AC A 
GGGCCAAGGG 
CCTGGCACCC 
GGACTACTTC 
GCACACCTTC 
CGTGCCCTCC 
CAACACCAAG 
TGTGGAGTGC 



AAAAAGCCTG 
TACTGGATAG 
TTACCTGGAA 
AGAGACACAT 
GCCGTCTATT 
ACTCTGGTCA 
TCCTCCAAGA 
CCCGAACCGG 
CCGGCTGTCC 
AGCAGCTTGG 
GTGGACAAGA 
CCACCGTGCC 



GGGCCTCAGT 
AGTGGGTGCG 
GTAATAATTC 
CCACAAACAC 
ACTGTGCAAG 
CAGTCTCCTC 
GCACCTCTGG 
TGACGGTGTC 
TACAGTCCTC 
GCACCCAGAC 
AAGTTGAGCC 
CAGCACCTGA 



GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
AGGCGGGCTG 



CCTTCAACAT 
TTGTGCAGAT 
TGACTGCCGT 
ACGTGGTCAG 
GGCCTGACCA 
GGAACGACAC 
TCAGGGAGTT 
ACGCTCTCTA 
TGATGGGCGA 
GCCTGTGGAC 
CACCCACGCA 
TTGTTCCCGA 
TGGCCCAAGC 



CCAGACATTT 
CCTGAGCCGC 
GGGGAAGCTG 
TGAGCCACTG 
GGTGTCTGCG 
CTTCAACCGA 
TGCCATTGTT 
TGACGTCTAC 
CTTCAATGCG 
AAGCCCCACC 
CTGTGCCTAT 
CTCGGCTCTT 
CATCAGTGAC 
CAAGGTTTGA 



GGGGAGACCA 
TACGACATCG 
CTGGACAACC 
GGACGGAACA 
GTGGACAGCT 
GAGCCAGCCA 
CCCCTGCATG 
CTGGATGTCC 
GGCTGCAGCT 
TTCCAGTGGC 
GACAGGATCG 
CCCTTTAACT 
CACTATCCAG 



AGATGTCCAA 
CCCTGGTCCA 
TCAATCAGGA 
GCTATAAGGA 
ACTACTACGA 
TTGTCAGGTT 
CGGCCCCGGG 
AAGAGAAATG 
ATGTGAGACC 
TGATCCCCGA 
TGGTTGCAGG 
TCCAGGCTGC 
TGGAGGTGAT 



TGCCACCCTC 
GGAGGTCAGA 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGGGG 
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1 UK 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 



FDDNASE107 



1590 BP SS-DNA 



SYN 



25-AUG-2000 



3 !U 



354 A 



Location/Qualifiers 
join(l. .>720,<7 93. .1590) 
/note="l to 1590 of PAS107.dna [Split] 
721. .792 

/ no te="l to 72 of 103/1071inker" 

join (721. . >7 71,<772. .792) 

/note-"l to 78 of 1021inker [Split]" 

473 C 448 G 315 T 0 OTHER 



ins- 



BASE COUNT 
ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT 
TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG 
GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC 
GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC 
GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG 
TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC 
AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG 
GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC 
GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC 
TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 
AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC 
GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA 
AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA 
GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA 
GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA 
ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACT ACT AC GA 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG 
12 61 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG 

14 41 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT 

15 61 GGCGGACCCA AAAAGAAGCG CAAGGTTTGA 

// 



121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 



CCACTCCCAG 
GAAGGTGTCC 
CCAGGCTCCA 
TAGATACAAT 
AGCCTACATG 
ATCCTACGAC 
AGCCTCCACC 
GGGCACAGCG 
GTGGAACTCA 
AGGACTCTAC 
CTACATCTGC 
CAAATCTTGT 
GGGCGGGCTG 
TGCCACCCTC 
GGAGGT C AG A 
CGCACCAGAC 
GCGCTACCTG 
TGATGGCTGC 
CTTCTCCCGG 
GGACGCAGTA 
GGGCTTGGAG 
CTCCCAGTGG 
CAGCGCTGAC 
GATGCTGCTC 
CTATGGCCTG 
GCTGAAGGGG 
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12? 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
frag 

frag 

frag 

frag 



FDDNASE107 



1599 BP SS-DNA 



SYN 



29-AUG-2000 



it 



355 A 



Location /Qualifiers 
10. . 1599 

/note="l to 1590 of FdDNasel07correct fl 
join (10. . >72 9,<8 02. . 159 9) 
/note="l to 1590 of PAS107.dna [Split] 
730. .801 

/note="l to 72 of 103/1071inker" 

join (730. .>780,<781. .801) 

/note="l to 78 of 1021inker [Split]" 

479 C 450 G 315 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC 

61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG 

121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA 

181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG 

241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC 

301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA 

361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC 

421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG 

481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT 

541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT 

601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG 

661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA 

721 AAATCTTGTG AC AAAAC T C A CACATGCTGT GTCGAGTGTC CACCGTGTCC 

781 GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGGAGACCAA 

841 GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC 

901 GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT 

961 GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG GACGGAACAG 

1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA 

1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT 

1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC 

1201 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA 

12 61 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTCAATGCGG GCTGCAGCTA 

1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT 

1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT 

14 41 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT 

1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT 

1561 CTGAAGGGGG GCGGACCCAA AAAGAAGCGC AAGGTTTGA 

// 



TACAGGTGTC 
GGCCTCAGTG 
GTGGGTGCGC 
TAATAATTCT 
CACAAACACA 
CTGTGCAAGA 
AGTCTCCTCA 
CACCTCTGGG 
GACGGTGTCG 
ACAGTCCTCA 
CACCCAGACC 
AGTTGAGCCC 
AGCACCAGAG 
GATGTCCAAT 
CCTGGTCCAG 
CAATCAGGAC 
CTATAAGGAG 
CTACTACGAT 
TGTCAGGTTC 
GGCCCCGGGG 
AGAGAAATGG 
TGTGAGACCC 
GATCCCCGAC 
GGTTGCAGGG 
CCAGGCTGCC 
GGAGGTGATG 



Fig. 19(C) 



-4 * ' f ' 

S ; 
* - > + 

9 is 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA AC A GCT AC A GGT GTC* CAC 



M G V/ S C I I L F L V A T A T G V 'H ^ V - 1 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTF S AYWI E 

171 .-- 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQ'APGKGLEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNS RYNEKFKG RVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELS SLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



WGQGTLVTVSSASTKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



G A L T S G V H T F P A V \j Q S S G 
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* 1 

603 612 621 630 639 643 * /J: 

CTC TAC TCC CTC AGO AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG ;> - 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AA.C ACC AAG GTG GAC AAG AAA 



TYICNVNKKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC AC A TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC PP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 



CPAPEGGLKIAAFNIQTF 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



H GETKMSNATLVSYIVQIL 

873 882 891 900 909 918 

e** AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AG A GAC AGC CAC CTG ACT GCC 



SRYDIALVQEVRDSHLTA 

927 936 945 954 963 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 



VGKLLDNLNQDAPDTYHY 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 



VVS EPLGRNSYKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 



YRPDQVSAVDSYYYDDGC 

1089 1098 1107 1116 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 



EPCGNDTFNREPA I V R r F 

1143 1152 1161 1170 1179 1168 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 



srf'tevrefaivplhaap 

1197 1206 1215 1224 1233 1242 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA 



GDAVAEIDALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 



EKWGL EDVMLMG DFNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG AC A AGC CCC ACC 



SYVRPSQWSSIRLWTSPT 

1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC AC A GCT AC A CCC ACG CAC TGT 



FQWLI PDSADTTAT PTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 



AYDR IVVAGMLLRGAVV P 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 



DSALPFNFQAAYGLSDQL 

1521 1530 1539 1548 1557 1566 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA 



AQAI S DHYPVEVMLKGGG 

1575 1584 
CCC AAA AAG AAG CGC AAG GTT TGA 3' 
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Mammalian expression of humanised HMFG1-D Nase constructs 
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Fig. 31(A) 
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Fig. 33(A) 
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Fig. 34(A) 
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Fig, 35(A) 
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